Expanding Queries Through Word Sense Disambiguation
نویسندگان
چکیده
The use of semantic information in the right way can lead to improved precision and recall figures in Information Retrieval (IR) systems. This assumption is the start point for the work carried out by the MIRACLE research team at ImageCLEF 2006. For this purpose, an implementation of the specification marks Word Sense Disambiguation (WSD) method [4] has been developed. This method is based on WordNet [2] and tries to select the right sense of each word appearing in the query. This allows the inclusion of only the correct synonyms when a semantic expansion is done. This selective expansion method has been combined with a deeper linguistic analysis to interpret negations and filter out common phrases and expressions used in query captions. Results of the application of these techniques to the image retrieval task in CLEF 2006 are also included.
منابع مشابه
Word Sense Disambiguation for Cross-Language Information Retrieval
We have developed a word sense disambiguation algorithm, following Cheng and Wilensky (1997), to disambiguate among WordNet synsets. This algorithm is to be used in a cross-language information retrieval system, CINDOR, which indexes queries and documents in a language-neutral concept representation based on WordNet synsets. Our goal is to improve retrieval precision through word sense disambig...
متن کاملAutomatic query expansion and word sense disambiguation with long and short queries using WordNet under vector model
This paper describes the experimentation conducted to test the effectiveness of automatic query expansion and word sense disambiguation (WSD) using short and long query of a topic TREC under vector model. We ran different experiments generating queries under vector model using linguistic information extracted from WordNet. Results show that query expansion with short queries and long queries is...
متن کاملOn the Importance of Word Sense Disambiguation for Information Retrieval
Research in information retrieval has led to mixed results about the impact of natural language processing. This paper discusses the importance of word sense disambiguation despite these mixed results. We first discuss some of the factors that can cause apparent inconsistency in retrieval performance with regard to natural language processing: instability of test collection queries, different b...
متن کاملTopic Level Disambiguation for Weak Queries
Despite limited success, today’s information retrieval (IR) systems are not intelligent or reliable. IR systems return poor search results when users formulate their information needs into incomplete or ambiguous queries (i.e., weak queries). Therefore, one of the main challenges in modern IR research is to provide consistent results across all queries by improving the performance on weak queri...
متن کاملEvaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-language Information Retrieval
One of the aims of EuroWordNet (EWN) was to provide a resource for Cross-Language Information Retrieval (CLIR). In this paper we present experiments which test the usefulness of EWN for this purpose via a formal evaluation using the Spanish queries from the TREC6 CLIR test set. All CLIR systems using bilingual dictionaries must find a way of dealing with multiple translations and we employ a WS...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006